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ABSTRACT 

We present the results of NICMOS imaging of two massive galaxies photometrically selected to 
have old stellar populations at 2; 2.5. Both galaxies are dominated by apparent disks of old stars, 
although one of them also has a small bulge comprising about 1/3 of the light at rest-frame 4800 A. 
The presence of massive disks of old stars at high redshift means that at least some massive galaxies 
in the early universe have formed directly from the dissipative collapse of a large mass of gas. The 
stars formed in disks like these may have made significant contributions to the stellar populations of 
massive spheroids at the present epoch. 

Subject headings: galaxies: high-redshift — galaxies: formation — galaxies: evolution 



1. INTRODUCTION 

Considerable observational evidence has built up over 
the past few years that a substantial fraction of the mas- 
sive galaxies around us today were already massive at 
very early epochs. This evidence comes primarily from 
three sources: 

• Studies of local massive elliptical galaxies indi- 
cate that the stars in the most massive galaxies 
generally form ed very early and over very short 
time intervals (IPeebled l2002t iThomas et"al1 120051: 



of massive galaxi es were in place at even ear- 
lier epochs (e.g.. IStockton. Canalizo. fc Maiharal 
2004 iMcCarthv et all 120041: Ivan DqkkumI 12004 



iNelan et al.ll2005HRenzinill2ob6[ ). Stars in less mas- 
sive spheroids formed, on average, later and over 
longer time spans. 

• Massive galaxies in clusters show little evidence for 
signif icant evolution up to at least redshift ^ 1 
fe.o., lde Propris et al.ll2007l: fScarlata et al.ll2007l ). 

• Direct observations of massive galaxies at red- 
shifts > 1.5 that are dominated by already old 
stellar populations show that significant numbers 

^ Based in part on observations made with the NASA/ESA Hub- 
ble Space Telescope, obtained at the Space Telescope Science In- 
stitute, which is operated by the Association of Universities for 
Research in Astronomy, Inc., under NASA contract NAS 5-26555. 
These observations are associated with program no. 10418. Addi- 
tional data were collected at the Subaru Telescope, which is op- 
erated by the National Astronomical Observatory of Japan, and 
at the W. M. Keck Observatory, which is operated as a scientific 
partnership among the California Institute of Technology, the Uni- 
versity of California, and the National Aeronautics and Space Ad- 
ministration. The Observatory was made possible by the financial 
support of the W. M. Keck Foundation. 

Also at Cerro Tololo Inter-American Observatory, Casilla 603, 
La Serena, Chile. 
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Although the existence of massive galaxies at high red- 
shifts is now well documented, there have been only a 
few high-resolution studies of their morphologies (e.g. 



Yan fc Thompsonll2003l : lStockton eralll2004 IZirm et all 
20071: iToft et al.ll2007D Morphologies are important, be- 
cause they may well retain signs of the formation history 
of the galaxies. This is particularly true for galaxies that 
show little or no recent star formation, so that we are 
able to observe relatively clean examples of the stellar 
population that formed earliest and that comprises the 
bulk of the mass of the galaxy. In this paper, we present 
deep Hubble Space Telescope (HST) NICMOS imaging of 
two galaxies with virtually pure old stellar populations 
at z ~ 2.5. In § 2, we briefly recount how these galaxies 
were selected. In § 3, we describe the observations and 
reduction procedures. In § 4 and § 5, we analyze model 
fits to the images to determine morphologies, and in § 6 
we discuss the implications of our conclusions. We as- 
sume a flat cosmology with Hq = 73 km s^^ Mpc^^ and 
flM = 0.28. 

2. IDENTIFYING GALAXIES WITH OLD STELLAR 
POPULATIONS AT HIGH REDSHIFTS 

Our procedure for selecting galaxies with old stellar 
popu lations is described in some detail in lStockton et all 
p004f l: here we give a brief synopsis. We observe fields 
of radio sources in certain specific redshift ranges, select- 
ing galaxies with photometric redshifts consistent with 
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that of the radio source. Radio sources generaUy serve 
as beacons for some of the more overdense regions in the 
early universe. Furthermore, the specific redshift ranges 
selected are chosen to optimize discrimination with stan- 
dard filter passbands between old stellar populations and 
highly reddened star-forming galaxies. One of these red- 
shift ranges is 2.3 < z < 2.7, for which the 4000 A break, 
strong in old stellar populatio ns, falls between the J and 
H bands. We have used the iBruzual fc CharlotI (|2003D 
(BC03) spectral synthesis m odels, and, more r e cently , 
preliminary versions of the iCharlot fc Bruzuall (l2007h 
(CB07) models, to evaluate and optimize our photomet- 
ric selection of old stellar populations at various red- 
shifts. The preliminary CB07 models include more realis- 
tic prescriptions f or th ermally pulsing asymptotic-giant- 
branc h stars (iMarigo fc G irardi 2007; see also MarastonI 
|2005() . Although at low redshifts (and for some SEDs at 
high redshifts) the new models can significantly lower the 
masses estimated from i^T-band photometry, at the red- 
shifts we are considering here for nearly pure old stellar 
populations, the masses (and ages) change hardly at all. 
The main effect of using the newer models is to reduce 
the amount of reddening required to obtain a good fit. 

If a stellar population were to have an age of 2 Gyr 
at z = 2.5 (corresponding to all of the stars forming at 
z = 9), its observed colors would be J — K k, 3.0 and 
J — H Ki 2.1. We use a photometric sieve procedure to 
optimize the selection with respect to available observing 
time, first obtaining relatively short J and K' integra- 
tions (typically 5 cr at J = 23 and > 10 ct at i^T' = 20). 
If any objects with J~K' ~ 3 are found, we then ob- 
tain H and deeper J imaging. Finally, for fields with 
objects matching the expected spectral-energy distribu- 
tions of an old stellar population at the redshift of the ra- 
dio source, we attempt to obtain deep imaging at shorter 
wavelengths (usually either R or /) to set constraints on 
any residual star formation. 

Among the galaxies found by this technique are 
one each in the fie lds of the radio galaxy 4C 23.56 
(IStockton et al.ll200 4') and the quasar 4C 05.84. We refer 
to these galaxies as 4C 23.56 ERl and 4C 05.84 ERl; they 
are both luminous objects, and they have stellar popu- 
lations that appear to be overwhelmingly dominated by 
old stars. 

3. OBSERVATIONS AND DATA REDUCTION 
3.1. Ground-Based Optical and Near-IR Observations 

We obtained most of the near-IR obs ervations (J, H, 
and K') with the CISCO IR camer a (iMotohara et~aLl 
I2002D on the 8.2 m Subaru Telescope (jive et al.ll2004D in 
observing runs on 2000 November 8 (UT), 2001 August 
5 and 6, and 2002 May 30- June 1. The images have a 
scale of 0'.'105 pixel"^ and a field of ~ l.'S. In addition, 
we carried out deep i?-band imaging of bo th fields with 
the Echelle Spectrograph and Imager (ESL'S heinis et all 
|2002) on the Keck II Telescope on 2002 August 7. Both 
the IR and optical imaging were reduced according to 
standard procedures using our own IRAF scripts. The 
calibrations used observations of UKIRT Fai nt Standards 
(jHawarden et al.l 120011: iLeggett et aT '2006') for the IR 
photometry and Landolt fields (Landolt 1992) for the 
i?-band imaging. 

We also observed 4C 23.56 ERl at K' with the Subaru 
36-element curvature-sensing adaptive optics (AO) sys- 



tem ()Takami et al.l 120041) and the Infrared Camera and 
Spectrograph (IRCS: lKobavashi et al.l [ 2000l) on 2002 Au- 
gust 1 7. These results were reported bv IStockton et al.] 
(l200l . but we will refer to them again in this paper. 
We used IRCS without the AO system, but with excel- 
lent natural seeing (final images have FWHM of 0'.'35) 
to obtain a very deep image of the 4C 05.84 field in the 
K filter on 2004 August 1. Finally, we obtained J-band 
imaging of the 4C 05.84 field with NIRC2 and the Keck II 
laser-guide-star adaptive-optics system on 2007 August 
21. 

3.2. Hubble Space Telescope NICMOS Observations 

The NICMOS observations used the NIC2 cam- 
era (0'.'075 pixel-i) and the FllOW and F160W fil- 
ters. They were obtained on UT 2005 January 3 
(4C 05.84 ERl, F160W, total exposure 5376 s), 2005 Jan- 
uary 4 (4C23.56ER1, F160W, total exposure 8192 s), 
2005 January 8 (4C05.84ER1, FllOW, total exposure 
8448 s), and 2005 May 16 (4C 23.56 ERl, FllOW, total 
exposure 11264 s) as part of HST program 10418. Af- 
ter doing a first-pass combination of the images to get 
a rough idea of the quality of the data, we went back 
to the calnica processed images and corrected these for 
bias offsets and inverse flatfield effects using the STSDAS 
pedsky task. Most of the FllOW images were obtained in 
orbits impacted by passages through the South Atlantic 
Anomaly (SAA) and needed speci al processing. We used 
the I DL routine saa_clean.pro (|Bergeron fc DickinsonI 
l2003f l to generate an image of the persistence from the 
routinely taken post-SAA dark images and subtract it 
from the science images. Finally, for these images, ped- 
sky was run again to remove any residual bias pedestals. 

We then generated a bad-pixel mask from the data 
quality file, adding an additional mask for the corono- 
graphic occulter, which produces background that is de- 
tected in both filters. Most of the cosmic rays were re- 
moved with t he contributed IRAF procedure lacos-im 
()van Dokkum|[2001 ). At this point the images from the 
individual dither positions were combined onto a sub- 
sampled grid with the STSDAS drizzle task to produce 
the final image. To choose the optimum drizzle parame- 
ters for our purposes, we performed a series of tests with 
artificial PSFs generated by Tiny Tim'^. We ended up 
choosing a drop size of 0.7 and a subsampling factor of 2. 
The final combined images had FWHM of 0'.'133 for the 
F160W images and 0'.'115 for the 4C 23.56 ERl FllOW 
image. We were unfortunately unable to produce a useful 
image of 4C05.84ER1 in the FllOW band because of a 
combination of the object's low surface brightness at that 
wavelength and residual effects on the detector of the pre- 
vious SAA passage. The final 3cr surface brightness limits 
(in the Vega system) were /xigo ~ 22.3 and /ino ~ 23.3 
for 4C 23.56 ERl, and ^iigo « 22.0 for 4C 05. 84 ERl. 

The drizzling process inevitably introduces some level 
of correlation between adjacent pixels. Where we needed 
to estimate absolute errors (such as for our radial-surface- 
brightness plots), we made a statistical correction to 
the error determinations, following the prescriptions of 
'Fruc hter fc Hook! ([2502) 

Although we obtained images of stars for point-spread- 

^ Tiny Tim was written by J. Krist and can be found at 
|http: //www.stsci.edu/software/tinytim/tinytim.html| 
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function (PSF) determination at the ends of some of the 
orbits, PSFs modeled from Tiny Tim were quite consis- 
tent with the stellar profiles. Subtracting TinyTim mod- 
els from the observed stars gave residuals that were less 
than 1.5% (rms) of the peak over the central FWHM 
region (and much lower outside this region), with maxi- 
mum pixel deviations of 3%. The TinyTim profiles also 
have the advantage that they can be generated on a sub- 
sampled grid to minimize interpolation errors in match- 
ing the profiles to the undersampled NICM0S2 images. 
We accordingly used subsampled TinyTim model profiles 
in our analysis. 

4. 4C 23.56 ERl 

In the field of the z = 2.483 radio galaxy 4C 23.56, 
our photometric selection procedure picked out a galaxy 
that had previously been n oted as a very red object by 
iKnopp fc Chambers! ()1997| ). As mentioned above, we 
have already rep orted on our Subaru A O /IRCS imaging 
of 4C 23.56 ERl (jStockton et al.ll2004t there, the galaxy 
is referred to as 4C 23.56 KC68). The main conclusions of 
that paper were that (1) the galaxy indeed has a redshift 
close to that of the radio galaxy, (2) the best fit to the 
photometry is an old (2-3 Gyr) stellar pop ulation with 
little reddening (< 0.2Ay of'Ca lzetti et al.ll2000l extinc- 
tion), and that (3) the morphology of this massive, old 
galaxy looked surprisingly disklike, with a projected axial 
ratio of 0.3 and a Sersic index of 1.5. Because the details 
of the previous observations are a vailable in that paper 
and in a brief follow-up report (jStockton fc McGrathI 
l2007f) . we restrict our discussion here to a comparison 
of the NICMOS imaging with the previous AO imaging. 

The NICMOS FllOW and F160W images are shown in 
Fig. m along with the best-fit Sersic models (convolved 
with the PSF), the residuals from the subtraction of the 
models from the data, and, again, the best-fit models 
(but without convolution with the PSF). We determine 
total magnitudes for 4C 23.56 ERl from the Sersic mod- 
els, finding (on the Vega system) mpuow — 23.39 ±0.17 
and mpifin w = 20.82 ± 0.04 after correction for G alactic 
reddening (jSchlegel. Finkbeiner. fc Davis I I1998D . The 
quoted uncertainties include only sky noise and uncer- 
tainty in the sky level; they do not include any deviations 
between the models and the data (which are in any case 
quite small over the region of good S/N for the data) or 
other potential systematic effects. 

We discuss the F160W image first, since it has a much 
higher S/N ratio than does the FllOW image. Figure [5] 
shows the radial-surface-brightness profile for the F160W 
image of 4C 23.56 ERl, along with the best-fit r^/'*-law, 
expo nential, an d Sersic profiles, determined using gal- 
fit (jPeng et al.l 12002 ). Among these, the Sersic profile 
clearly gives the best fit, as expected, because of the extra 
degree of freedom in the model. The Sersic profile has an 
index n = 1.52 ±0.06, an effective radius re = 0'.'24±0.01, 
and axial ratio b/a = 0.32. The uncertainties in n and r^, 
have been estimated by re-running the models with the 
sky level set la above and below its medi an value. From 
our S ubaru AO imaging in the K' band (jStockton et al.l 
120041) ■ we had obtained n = 1.49, = 0'.'22, b/a = 0.33, 
so the two independent profiles in different bands are in 
remarkably good agreement. Although the AO imaging 
had slightly better FWHM, and the two datasets had 
similar S/N near the center of the galaxy, the NICM0S2 



data extends farther in semi-major axis because of its 
lower sky background. We show a comparison of the 
two profiles in the region of overlap in Fig. [31 Both the 
r^/'^-law and exponential profiles fit the observed profile 
poorly. Adding a small (re = O'.'l), weak (14% of total 
light) bulge component to an exponential profile with an 
re — 0'.'26 gives a fit that is as good as that of the Sersic 
profile within a semi-major axis of 0'.'6 but somewhat 
worse beyond this radius. 

The FllOW image shown in Fig. [U which samples the 
morphology shortward of the 4000 A break (assuming 
that 4C 23.56 ERl has the same redshift as 4C 23.56 it- 
self), superficially has an even more "disky" appearance 
than does the F160W image. This is partly due to the 
sharper PSF at this wavelength: notice that the best-fit 
Sersic models without PSF convolution look much more 
similar than do the models with PSF convolution. Never- 
theless, there may be a detectable difference in morphol- 
ogy in the two bands. The FllOW Sersic model has an in- 
dex n = 1.03 ±0.10 (i.e., essentially a pure exponential), 
an effective radius of 0'.'28 ± 0'.'02,'and b/a = 0.31. The 
radial-surface-brightness profile is shown in Fig. 31 along 
with the best-fit Sersic model and the F160W Sersic 
model (adjusted by a constant magnitude offset to ap- 
proximately match the FllOW points). The observed 
differences are barely significant, given the uncertainties, 
but they seem to indicate a small color gradient, such 
that the outer parts of the galaxy are slightly bluer (at 
least out to a semi-major axis of 0'.'7, at which point the 
uncertainty in the sky background level becomes dom- 
inant). This cannot be a large effect because of the 
tight upper liniits on the R and /-band magnitudes (see 
[Stockton et al.l [20041 ). Nevertheless, it does suggest a 
possible slight decrease in mean age and/or mean metal- 
licity of the stellar population as one progresses from the 
center to the outskirts of the galaxy. 

5. 4C 05.84 ERl 

4C 05.84 ERl was found in the field of the z = 2.323 
quasar 4C05.84 (Fig. [5]). The SED of 4C05.84ER1 is 
shown in Fig. \6\ including photometry from our Spitzer 
IRAC images, which will be discussed elsewhere in more 
detail in the context of a larger sample of objects. While, 
for 4C 23.56 ERl, only upper limits at R and / bands 
have been obtained, for 4C 05.84 ERl we have detections 
at R — 24.6 and / — 23.4, indicating the presence of some 
younger stars. In attempting to fit the observed SED, we 
have explored a range of exponentially decreasing star- 
forming models as well as instantaneous burst models; we 
have also considered models with metallicities of solar, 
0.4 solar, and 2.5 solar. 

The formal best-fit model SED is at a redshift of 2.93, 
substantially higher than that of 4C 05.84 itself. This 
model has a 0.4-solar-metallicity population with an age 
of 900 Myr, an exponential time constant of 100 Myr, 
and a reddening Ay = 0.06 mag. If we restrict ourselves 
to models with redshifts close to that of the quasar, we 
get a reasonable fit with a solar-metallicity model with a 
redshift of 2.40, an age of 1.02 Gyrs, an exponential time 
constant of 200 Myr, and a reddening Ay = 0.58 mag. 
This model fits the /-band and IRAC 5.8 /im photome- 
try less well but the IRAC 7.9 /im photometry slightly 
better. Both of these models are shown in Fig. [51 We 
have no firm grounds for choosing one of these SEDs over 
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Fig. 1.— NICMOS2 images of 4C 23.56 ERl in the FllOW and F160W filters. The best-fit GALFIT Sersic models, convolved with the 
PSF, are shown in the second panel of each row, the difference between the observed images and the models in the third panel, and the 
models without convolution with the PSF in the last panel. Insets show lower-contrast versions of the images. North is up and East to the 
left for this and all following images. 
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Fig. 2.— Radial-surface-brightness profile of the NICMOS2 
F160W image of 4C 23.56 ERl, with best-fit r^/'^-lam, exponen- 
tial, and Sersic profiles shown. The upper panel shows the profiles, 
and the lower panel shows the deviations of the observed profile 
and the two other models from the best-fit Sersic profile. Sample 
points in this and subsequent plots are at intervals of 1 subsampled 
pixel in the drizzled images (0'.'038) along the major axis, so data 
values and errors for adjacent points are fairly strongly correlated 
because of drizzling, PSF smearing, and compression of the scale 
along the minor axis. 



the other; but, given the uncertainties in the models and 
possible star-formation histories, we will accept for the 
remainder of this paper that the redshift closer to that 
of 4C 05.84 itself is the correct one. In either case, we 
are dealing with a massive galaxy comprising stars that 
mostly formed ^ 1 Gyr before the observed epoch. 

We show our NICM0S2 F160W image of 4C 05. 84 ERl 
in Fig. [3 along with our best-fit GALFIT model. We have 
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Fig. 3. — Comparison of normalized differential surface- 
brightness profiles for 4C 23.56 ERl in the region of overlap. All 
profiles are shown relative to the NICMOS2 F160W Sersic fit, and 
the difference between F160W and K' magnitudes has been re- 
moved via a simple slide fit. The red points and line show the 
F160W profile and Sersic fit, respectively, and the blue points and 
line show the Subaru AO K' profile and fit. 

tried a series of models, including, again, r^/^-law, ex- 
ponential, and Sersic. Radial-surface-brightness profiles 
of these are shown in the left panel of Fig. [H In this 
case, even the Sersic profile is not a particularly good fit. 
We get a significantly better fit with a two-component 
model incorporating a small r^/^-law bulge comprising 
31 ± 15% of the light and an exponential disk account- 
ing for the rest. This model is compared with the best 
Sersic profile fit in the right panel of Fig. [5] For the 
two-component model, the disk component has an effec- 
tive radius rg = 0'.'89 ± 0'.'09, and the bulge component 



5 




a (arcsec) 

Fig. 4.— Radial-surface-brightness profile of the NICMOS2 
FllOW image of 4C 23.56 ERl, with best-fit Sersic profile shown 
(red trace). Also shown is the best-fit F160W Sersic profile from 
Fig. [2] shifted by a constant magnitude offset to match the FllOW 
points (blue trace). The upper panel shows the profiles, and the 
lower panel shows the deviations of the observed FllOW profile 
and the F160W model from the best-fit Sersic profile to the FllOW 
data. 




Fig. 5.— The field of 4C 05.84 ERl. The image is a deep (6750 
s) ^ST-band integration with the Subaru Infrared Camera and Spec- 
trograph in 0'.'35 seeing. North is up and East to the left. 

has Te = 0'.'37 ± 0'.'2. This best-fit model gives a total 
magnitude (on the Vega system) mpi^ow = 20.28. 

There is some evidence from our R and /-band imaging 
and our recent Keck AO imaging in the J band that the 
bulge component virtually disappears at these shorter 
wavelengths, indicating that the two morphological com- 
ponents have different stellar populations. Such a result 
that would not be surprising. The SED shown in Fig. [6] 
would then be the linear combination of the two SEDs, 
with the bulge likely being a nearly pure old popula- 
tion and with younger stars being confined to the disk 
component. We stress, however, that even the disk com- 
ponent must be dominated by old few hundred Myr) 
stars, with little very recent star formation. We have 
experimented with a range of combinations of SEDs at 
the quasar redshift, but none with simple star- formation 
histories (instantaneous bursts or exponentially decaying 
bursts) gave a significantly better fit than did the single- 
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Fig. 6.— Spectral-energy distribution for 4C 05.84 ERl. The 
blue curve shows the best-fit CB07 stellar population model to the 
photometry, at a redshift z = 2.93. The red curve shows the best- 
fit model with a redshift close to that of 4C 05.84 itself (zq). The 
redshifts z, Calzetti-law extinctions Ay, and ages of the stellar 
populations are indicated for each model. See the text for details. 

population SEDs shown in Fig. [HI We will explore this 
possibility in more detail elsewhere. 

6. DISCUSSION 

Table [1] summarizes the parameters for the two 
galaxies. The morphologies of both 4C 23.56 ERl and 
4C 05.84 ERl appear to be dominated by disks of old 
stars. However, the disks are quite different in scale. 
4C 05.84 ERl, at least, also appears to have a small bulge 
comprising about 1/3 of the total light in the F160W fil- 
ter (~ 4800 A, rest frame). We cannot exclude the pos- 
sibility that 4C 23.56 ERl also has a weak bulge, with up 
to - 15% of the total hght in the F160W filter; indeed, 
if the slight apparent difference in morphology between 
the F160W and FllOW images is real, such a difference 
would seem to favor this possibility. 

But it is the presence of massive, old disks that 
continues to give the strongest constraint on formation 
mechanisms. Such disks also have been seen at redshifts 
1.5, where no rmal ellip t icals with r^'^^-law profiles 
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It is difficult to imagine that 
these massive disks could have formed via any process 
other than the dissipative collapse of a large cloud 
of gas. Such disks are also unlikely to have survived 
major merging events, although the bulge component in 
4C 05.84 ERl may testify to either some level of minor 
merging activity or bulge building via disk instabilities. 

For galaxies at z ~ 2.5, the evidence for a domi- 
nant old stellar population depends on the inflection 
in the SED shortward of the H band, and establish- 
ing this inflection with optical/near-IR photometry de- 
pends on the relatively short baseline from the H to 
the K band. Furthermore, at the present epoch, es- 
sentially all strongly disk-dominated galaxies show ev- 
idence for continued star formation. It is therefore not 
too surprising that claims of passive disks at h igh redshift 
should be doubted (e.g., iPierini et al.ll2005l ). However, 
as Fig. [5] shows, Spitzer IRAC data is entirely consis- 
tent with the SED of a moderately old stellar population, 
and no plausible SED incorporating very recent star for- 
mation combined with dust would fit the observed pho- 
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Fig. 7.— The NICMOS2 image of 4C 05.84 ERl in tlie F160W filter (left panels). The best-fit GALFIT composite ■rl/'*-law + exponential 
model, convolved with the PSF, is shown in the second panel of each row, the difference between the observed images and the model in 
the third panel, and the model without convolution with the PSF in the last panel. This last panel is divided into 3 sub-panels: the 
middle one of these shows the composite model, the left one shows the exponential component alone, and the right one shows the r^/^-law 
subcomponent alone. The lower panels show lower-contrast images, except for that for the residual image, which shows a slightly smoothed 
high-contrast version of the image. 
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Fig. 8.— Radial-surface-brightness profile of the NICMOS2 F160W image of 4C 05.84 ERl. In the left panel, the best-fit rl/-*-law, 
exponential, and Sersic profiles are shown in the same way as in Fig. [2] In the right panel, a composite model consisting of a small r^/^-law 
bulge, accounting for 31% of the light, and an exponential disk comprising the remainder, is compared with the best-fit Sersic profile. In 
this case, the differentials in the bottom part of the panel are made with respect to the composite model. 



tometry. We have recently also obtained IRAC imaging 
of the field of 4C 23.56, and our analysis of these data 
shows that the IRAC photometry falls squarely on our 
best-fit solar-metallicity BC03 model determined from 
the optical/near-IR photometry alone: an instantaneous 
burst with an age of 2.6 G yr and an extinction Ay = 0.16 
mag (jStockton fc McGra th 2007) . Using the more recent 
preliminary CB07 models, with their improved treatment 
of AGB stars, wc obtain a stellar population age of 2.8 
Gyr with Ay = 0. Again, no plausible model with signif- 



icant star formation and reddening would fit these data. 

Masses for these galaxies can be estimated from 
the mod e l fits . Assuming solar metallicities and a 
IChabrieJ (|2003f l initial mass function, we obtain a mass 
of 3.9 X 10"Mo for 4C 23.56 ERl and 3.3 x 10" Af© 
for 4C 05.84 ERl (assuming the model at z = 2.4 with 
Av = 0.58). 

While the stellar-population age of 4C 05. 84 ERl in- 
dicates that the last major star-formation episode oc- 
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TABLE 1 

Model Parameters for 4C 23.56 ERl and 4C 05.84 ERl 



Galaxy 


Filter 


Sersic n 


re 

{") 


(kpc) 


4C 23.56 ERl 
4C 23.56 ERl 

4C 05.84 ERl 


FllOW 
F160W 

F160W 


1.03 ±0.10 
1.52 ±0.06 

l.OO'' 

4.00=^ 


0'.'28 ± 0'.'02 
0'.'24±0'.'01 
0'.'89 ± 0'.'09 
0'.'37 ± 0.20 


2.2 ±0.2 
1.9±0.1 
7.1 ±0.8 
3.0 ± 1.6 



^ The Sersic indices for the two model components for 4C 05.84 ERl 
have been fixed at these values, which correspond to exponential 
and r^/^-law profiles, respectively. 



curred at z 3.7, when the universe was ~ 1.8 Gyr 
old, 4C 23.56 ERl has a stellar-population age that is 
formally slightly greater than the age of the universe at 
z — 2.483. Clearly, the likely errors in the age determina- 
tion and the usual caveats regarding the age-metallicity 
degeneracy mitigate any implied paradox. Nevertheless, 
this massive galaxy must have formed at a very high red- 
shift. Models with [Fe/H] = +0.4 give an age of 1.9 Gyr, 
but with a significantly worse fit. 

It therefore seems likely that galaxy formation mod- 
els will have to allow for the presence of early-forming 
massive disks. This means that, at least in some dense 
regions, it has been possible to form ~ 3 x 10^^ Mq of 
stars within a relatively short time via dissipative col- 
lapse and without the aid of major mergers. While our 
selection criteria have ensured that the galaxies we have 
discussed here comprise essentially pure old stellar popu- 
lations, they may well be representative of many massive 
galaxies at high redshift, most of which would not be in 
our sample if they retained even tiny amounts of resid- 
ual star formation or if they had had any significant star 
formation within a few hundred Myr prior to the epoch 
at which we observe them. 

4C 05.84 ERl has a luminosity and an effective radius 
that are similar to those of many local galaxies. Our best- 
fitting Sersic model has rg — 6.3 kpc. For comparison, 
for galaxies of similar m ass from the Sloan survey with 
Sersic n < 2.5, IShen et a l. (2003) find = 7.2tli kpc. 
This galaxy could become, with passive evolution and 
perhaps a few minor mergers to increase the bulge-to- 
disk ratio somewhat, a typical SO galaxy at the present 
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epoch. On the other hand, we do not see galaxies like 
4 C 23.56 ERl at the present epoch. By the prescription 
of IShen et al] (l2003l) . a low-S ersic-index galaxy with the 
mass of 4C 23.56 ERl would have Ve = T-QlH kpc, but 
4C 23.56 ERl actually has re = 1.9±0.1 kpc. this means 
that the stellar mass surface density is much higher than 
for local galaxies, a result that has also b een found for 
other samples of distant red galaxies (e.;?., iTruiillo et alj 
2006; Toft ct al. 2007). It would seem that the only hkely 
path for such galaxies to evolve to objects consistent with 
the local population of galaxies is through dissipationless 
mergers. 

There is recent evidence that the most massive galax- 
ies in the local universe are likely the result of dry 
mergers of galaxies with stars that are already ol d 
and with very little gas (e.g., iBernardi et al.l 120071 ). 
With the constraint that these merging components 
must themselves mostly be fairly massive (to avoid a 
large dispersion and flattening in the observed color — 
magnitude relation for present-day m assive galaxies, e.g., 
iBower. Kodama. fc Terlevichl [TMl), it seems possible 
that these early massive disks may well be among the 
sources for the old stars that today are found in the most 
massive elliptical galaxies. 
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